Biomedical Text Mining: State-of-the-Art, Open Problems and Future Challenges
نویسندگان
چکیده
Text is a very important type of data within the biomedical domain. For example, patient records contain large amounts of text which has been entered in a non-standardized format, consequently posing a lot of challenges to processing of such data. For the clinical doctor the written text in the medical findings is still the basis for decision making – neither images nor multimedia data. However, the steadily increasing volumes of unstructured information need machine learning approaches for data mining, i.e. text mining. This paper provides a short, concise overview of some selected text mining methods, focusing on statistical methods, i.e. Latent Semantic Analysis, Probabilistic Latent Semantic Analysis, Latent Dirichlet Allocation, Hierarchical Latent Dirichlet Allocation, Principal Component Analysis, and Support Vector Machines, along with some examples from the biomedical domain. Finally, we provide some open problems and future challenges, particularly from the clinical domain, that we expect to stimulate future research.
منابع مشابه
Frontiers of biomedical text mining: current progress
It is now almost 15 years since the publication of the first paper on text mining in the genomics domain, and decades since the first paper on text mining in the medical domain. Enormous progress has been made in the areas of information retrieval, evaluation methodologies and resource construction. Some problems, such as abbreviation-handling, can essentially be considered solved problems, and...
متن کاملSample Chapter LNCS 8401: Interactive Knowledge Discovery and Data Mining: State-of-the-Art and Future Challenges in Biomedical Informatics
One of the grand challenges in our networked world are the large, complex, multi-dimensional and weakly structured data sets (big data) and the masses of unstructured information. These increasingly enormous amounts of data require new, efficient and user-friendly solutions for knowledge discovery. This challenge is most evident in the field of Biomedicine: the trend towards personalized medici...
متن کاملText Mining in Bioinformatics: Research and Application
Biomedical literatures have been increased at the exponential rate. To find the useful and needed information from such a huge data set is a daunting task for users. Text mining is a powerful tool to solve this problem. In this paper, we surveyed on text mining in Bioinformatics with emphasis on applications of text mining for bioinformatics. In this paper, the main research directions of text ...
متن کاملBioTxtM 2016 Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining
Methods based on deep learning approaches have recently achieved state-of-the-art performance in a range of machine learning tasks and are increasingly applied to natural language processing (NLP). Despite strong results in various established NLP tasks involving general domain texts, there is only limited work applying these models to biomedical NLP. In this paper, we consider a Convolutional ...
متن کاملClosed- and Open-loop Deep Brain Stimulation: Methods, Challenges, Current and Future Aspects
Deep brain stimulation (DBS) is known as the most effective technique in the treatment of neurodegenerative diseases, especially Parkinson disease (PD) and epilepsy. Relative healing and effective control of disease symptoms are the most significant reasons for the tangible tendency in use and development of this technology. Nevertheless, more cellular and molecular investigations are required ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014